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Abstract 

Following Feynman's prescription for constructing a path integral rep- 
^ resentation of the propagator of a quantum theory, a short-time approx- 

| imation to the propagator for imaginary time, N = 1 supersymmetric 

quantum mechanics on a compact, even-dimensional Riemannian mani- 
fold is constructed. The path integral is interpreted as the limit of prod- 
ucts of the approximate short-time propagator over a partition of a finite 
time interval. This limit is shown to converge uniformly for finite time 
as kernels, and to admit a version of steepest descent approximation suf- 
ficiently robust that the path integral proof of the Gauss-Bonnet-Chern 
Theorem becomes directly rigorous. 

^ Introduction 

Richard Fcynman [Fey42], building on ideas of Dirac [Dir33], identified the 
short-time approximation of the kernel of the propagator for quantum mechanics 
and, by extension, any quantum theory with the product of the exponential of 
i/h times the classical action and a certain coefficient. The full propagator 
should be the limit of the product of many short-time approximations, which 
can be readily reformulated (sweeping the infinite product of the aforementioned 
■ coefficient into the measure) as an integral over all possible paths (or, in general, 

£S) " field configurations) of the exponential of i/h times the action of the path. This 

is the path integral formulation of a quantum theory. Feynman's path integral 
1 has proven an extremely powerful tool in quantum theory, and, surprisingly, 

£SJ . has proven a powerful heuristic tool in mathematics as well. Adept use of 

path integral reasoning has constructed elegant arguments for ostensibly purely 
mathematical propositions whose proofs otherwise were unknown or required 
deep and quite different arguments [Wit 89, Wit91, SW94, Wit94]. One of the 
simplest is the path integral proofs of various versions of the Atiyah-Singer index 
" theorem, including for twisted versions of the Dirac and DeRham complexes 

(the latter giving both the Gauss-Bonnet-Chern theorem and the Hirzebruch 
signature theorem). 

These heuristic "proofs" are due independently to Alvarez-Gaume [AG83] 
and Friedan and Windey [FW84] based on an approach suggested by Witten 
[Wit82a, Wit82b]. The argument (focusing exclusively on the GBC case for sim- 
plicity) begins with a heuristic identifying on the one hand the path integral for 
imaginary-time N = 1 supersymmetric quantum mechanics (SUSYQM) taken 
over loops with anti-periodic boundary conditions for the fermions with on the 
other hand the supertrace of the kernel of the propagator, i.e. of the heat kernel. 
An argument of McKean and Singer [MS67] shows the supertrace is equal to the 
signed sum of the Betti numbers, independent of the time t. On the other hand, 
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stationary phase, which in the imaginary time case becomes steepest descent, 
applied formally to this path integral equates the small-i limit of the integral 
over loops to the Pfaffian of curvature, recovering the GBC Theorem. 

This approach to GBC represents arguably the simplest use of path integral 
reasoning to arrive at a clearly nontrival purely mathematical conclusion. As 
such it is an important benchmark for a rigorous interpretation of the path 
integral. A rigorous interpretation of a heuristic reasoning tool should offer a 
reasonably direct way to translate heuristic arguments into rigorous proofs. In 
particular, a construction of the path integral for the supersymmetric propagator 
that is robust enough to turn the path integral argument of the GBC Theorem 
into a real proof would represent an incremental step towards understanding 
similar arguments rigorously. This is the goal of the present paper. 

In fact, there is closely-related work in the literature. Bismut [Bis84a] uses 
stochastic techniques with the heat equation to give a proof of the index theorem 
in the spirit of the physics argument. Rogers [Rog87] also uses stochastic tech- 
niques to construct an explicit supersymmetric path integral and reproduce the 
path integral proof of GBC (and, in [Rog92b], the twisted Hirzebruch index the- 
orem, from which follows the full index theorem). Andersson and Driver [AD99], 
use stochastic techniques to construct a version of the bosonic path integral on 
curved space, though not towards the end of proving an index theorem. The 
innovation in this paper is to make rigorous Feynman's original time-slicing pro- 
cedure and to use the resulting path integral to prove the Gauss-Bonnet-Chern 
theorem. 

Feynman was quite clear there were two ambiguities in his formulation; 
namely, the specification of the coefficient and the existence of the limit. The 
latter is a straightforward-enough question, and its answer will take up the bulk 
of the paper. The former is subtle and requires some discussion. 

The path integral formalism suggests the propagator for a quantum theory 
(and by extension the quantum Hamiltonian) is completely determined by the 
action functional plus whatever goes into defining the putative measure on the 
space of world sheets, which one might hope would involve no more than kine- 
matic aspects of the theory. If true this would be quite a remarkable claim, 
because purely classical data would determine the quantum Hamiltonian. In 
the canonical approach quantizing the classical Hamiltonian typically requires 
a choice of how to order products of position and momentum operators, and 
we do not expect the resolution of such operator ordering ambiguities to be 
determined by the classical theory for all observables. The hope would be that 
some universal procedure picks out, based on the classical data for a theory, the 
particular choice of operator ordering for the quantum Hamiltonian that nature 
prefers. 

From the Feynman integral point of view this choice would appear in the 
coefficient discussed above. More specifically, Feynman proposed approximating 
the kernel of the propagator by 

where x and y are position variables in the initial and final states and S is 
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the classical action. To have a hope of converging properly this approximate 
kernel must agree with the kernel of the propagator to first order in t, which 
means that its derivative at t = should give the quantum Hamiltonian op- 
erator. The situation already becomes subtle in bosonic quantum mechanics 
on curved space. There the classical Hamiltonian is pig lj (x)pj, so the quan- 
tum Hamiltonian should be the Laplace-Beltrami operator plus terms resolving 
operator-ordering ambiguities. These would involve one or no spatial derivatives 
as well as traces of the curvature and higher order terms in the metric. 

There are several ways to chose C and thus specify a short-time approxima- 
tion to the propagator. Geometric quantization [Woo92] (specifically the BKS 
pairing) builds C out of what is essentially a Jacobian factor for the symplectic 
form [DeW57], yielding the Van Vleck determinant. Assuming the Laplace- 
Beltrami operator is the exact quantum Hamiltonian, the WKB approximation 
gives the same C (in the normalization of this paper, C = e Rlcc ^ yx,yx '/ 12 to the 
appropriate order). The presence of this term in the approximate kernel proves 
to be essential to get appropriate convergence to the kernel of a quantum Hamil- 
tonian which is the Laplace-Beltrami operator plus t/6, where r is the scalar 
curvature. Various other regularizations of the path integral yield Laplace- 
Beltrami plus other multiples of the scalar curvature [Ful96]. Aesthetic con- 
siderations lead mathematicians to hope and expect that the Laplace-Beltrami 
operator is the "correct" Hamiltonian on the nose. There is no realistic hope 
of determining which of these nature prefers, as it seems unlikely the difference 
between these Hamiltonians could be accessible to any foreseeable experiment 
[Sch81]. 

In the presence of all this confusion, the authors have abandoned the effort 
to offer a coherent procedure for choosing C based only on the action and the 
kinematics of the theory, and will choose it based on the desired Hamiltonian 
(which in the case at hand is the Laplace-de Rham operator on forms). That 
is, we choose C so that Ce~ s ^ x ' y ' t ^ is a solution to the Laplace-de Rham heat 
equation to zeroth order in t (and has total integral 1 to zeroth order). Inspec- 
tion of the resulting approximate kernel in Eq. (3.1) shows the Ricci curvature 
term corresponding to the Van Vleck term in the bosonic case, and a scalar 
curvature term which resolves operator-ordering ambiguities coming from both 
the bosonic piece and the four fermi term. 

One final technical point is in order. In general the metaplectic correction 
suggests replacing the naive notion of the Hilbcrt space in quantum theory, 
which is to say functions on position space, with half-densities or half-forms 
on this space [Woo92]. In the bosonic case this appears in the Van Vleck de- 
terminant, which is naturally viewed as a half-density on the product space. 
For supersymmctric quantum mechanics the naive Hilbert space is forms on 
the manifold, which corresponds (see Section 2) to the space of functions of 
a Grassman variable. Because bosonic and Grasmannian determinants cancel, 
half densities on this space can be naturally identified with the naive space of 
functions. The upshot of this observation is that the remainder of the paper 
will ignore the metaplectic correction and half-densities. 
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Technical Introduction 

The principal result of this paper is Theorem 6.1 which shows that the large 
partition limit of the kernel K defined in Eq. (3.1) converges to the heat kernel. 
Arguably the large partition limit captures the essence of the Feynman path 
integral while K captures the small-i approximation of the integrand in the path 
integral representation of the SUSY QM propagator for imaginary time, so the 
theorem may be viewed as a rigorous construction of the path integral for the 
imaginary-time propagator. Moreover, Eq. (6.2) and its special case Eq. (6.3), 
which relate the large-partition limit explicitly to the discretized path integral, 
provide a precise statement of the steepest-descent approximation to the path 
integral. The Gauss-Bonnet-Chern Theorem (Thm. 7.1) follows directly from 
Eq. (6.3), but only because the error term in this equation, expressing the 
difference between the path integral and its steepest-descent approximation, is 
bounded in a norm calibrated to send the supertrace of this error term to zero. 

The norm is somewhat complicated because it serves two masters: On the 
one hand, the error terms in the large partition limit of K must be bounded in 
this norm, which means it must be designed so that, with errors measured in 
this norm, K almost satisfies the heat equation, the norm is almost multiplica- 
tive under *, and K is almost a contraction, as expressed in Eqs. (4.8), (4.7), 
and (4.6), respectively. On the other hand, as above, it must ensure that the 
supertrace of the error terms in Eq. (6.3) must go to zero in order for Theo- 
rem 7.1 to hold. More explicilty, Eq. (4.8) requires a bound on error terms of 
the form of polynomials times Gaussians, which become more sharply peaked as 
t approaches 0. The norm compares these terms near the diagonal to Gaussians 
and away from the diagonal to powers of t (in fact t 1 suffices) times a constant. 
This would completely suffice were it not for the second consideration; in fact, 
Eqs. (4.6) and (4.7) could be strengthened and simplified. However, to get the 
behavior under the supertrace that leads to Gauss-Bonnet-Chern not only must 
the error terms be bounded by t to a power greater than 1, but the norm must 
take into account that these errors are forms; ultimately, the top degree piece of 
the error must be bounded by a power of t greater than n/2. This would suggest 
choosing the norm so that a multiform of highest degree k and constant in t has 
norm t -1 / 2 to the power max(0, k — 2). Such a norm still works for Eqs. (4.8) 
and (4.7), but, unfortunately, just barely fails for Eq. (4.6). So amend the norm 
again and instead of i -1 / 2 use t~ 1 / 2 + e / n for a sufficiently small e. This somewhat 
unintuitive choice satisfies all the conditions. 

The structure of the paper is as follows. Section 1 reviews standard re- 
sults from differential geometry and Riemann normal coordinates, in particular 
describing approximate solutions to generalized heat equations. This will all 
be used towards the proof of Eq. (4.8). Section 2 reviews standard facts and 
notation of Grassman variables, and defines the norm used in the proofs of 
Eq. (4.8) and of Eqs. (4.6) and (4.7) (as well as of Theorem 7.1). Section 3 
proves Eq. (3.5), from which Eq. (4.8) follows easily. Sections 4 and 5 prove 
Eqs. (4.6) and (4.7), as well as Eq. (4.9), which follows from Eq. (3.5). Sections 
6 and 7 use only Eqs. (4.6), (4.7), and (4.9) to prove Theorem 6.1 showing that 
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the large partition limit converges to the propagator and providing bounds on 
the error. Finally Section 8 proves the Gauss-Bonnet Chern Theorem, using 
Theorem 6.1 and the property of the norm with regard to the supertrace. 



1 Riemann Normal Coordinates and Kernels 

The following material on Riemann normal coordinates is standard and can be 
found for example in [Bcr], and follows straightforwardly from [BGV04]. 

Lemma 1.1 is used only in the proof of Proposition 4.1 and is elementary. 
Lemma 1.2 gives approximate fundamental solutions for some generalized heat 
equations in Riemann normal coordinates; it is used in the proof of Proposi- 
tion 3.1 and is purely computational. 

1.1 Riemann Normal Coordinates 

Let M be a fixed compact oriented Riemannian manifold of even dimension 
n. Since M is compact there is an injectivity radius such that for all y £ M the 
exponential map exp y : T y M — > M is invertible on points within the injectivity 
radius of y (with respect to the Riemann distance) . Refer to the inverse image of 
x G M within the injectivity radius as x y <G T y M (if the subscript is obvious omit 
it). A choice of orthonormal coordinates (<9i)™ = i on T y M then gives coordinates 
on a patch around M which are called Riemann normal coordinates. In such co- 
ordinates express the metric 3ij(x) = (<9j, dj) x , its inverse <7* J '( X ) , the Levi Civita 
connection r^.(x) = g kl (di, V9«^') x an d the Levi Civita parallel transport from 
y to x along the unique geodesic connecting them ^jj(x) = {di,*!($%dj) , in terms 



of the curvature at the origin y by the following formulas 

ffy(x) - % + ^R ikjl (0)x k x l + o(|x| 3 ) (1.1) 

ff «(x) = & - ^'i? fe yO)xV + o(|x| 3 ) (1.2) 

r* (x) = -^$.(0) + R£(0)] x l + o(|x| 2 ) (1.3) 

«Py(x) - b l3 + ^R ikj i(0)x k x l + o(|x| 3 ) (1.4) 



where C^|x| 3 j refers to an unnamed function which is bounded by some con- 
stant multiple of |x| 3 . Here the components of the Riemann curvature at x 
satisfy R[di, dj] d k = i? l j fc 9 / (x) and fiy H (x) = (d h R[di, dj] d k ) x . 
The Ricci and scalar curvatures are 

Ricciij(x) = g kl {x) R kij i{x) (1.5) 
t(x) =^'(x)Ricci ij (x) (1.6) 

Lemma 1.1 Ifx andy are Riemann normal coordinate expressions about some 
point for x and y which are close enough so the distance between them d(x, y) 
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is defined, then 

d(x,y) 2 Hx-y| 2 + 0(|x| 2 |y 2 |). (1-7) 

Proof: Consider the difference /(x, y) = d(x,y) 2 — x — y| 2 as a function of 
x for fixed y. It is smooth, and, since the exponential preserves length along 
geodesies through the origin, /(0,y) = 0, and the directional derivative of / at 
x = in the direction y is zero as well. Gauss's lemma ensures the directional 
derivative at x = in directions perpendicular to y are also 0. So 9//<9x = 
at x = 0. Since this is true for all y, d 2 f/dy 2 and d 3 f/dxdy 2 = at x = 0, as 
well, so d 2 f/dy 2 is bounded by some c |x| 2 for all x and y by Taylor's theorem. 
By symmetry / and df/dy are zero when y = 0, so integrating d 2 f/dy 2 twice 
against y gives the result. □ 



1.2 Heat Equations and Approximate Kernels 

Of particular interest are approximate solutions to various versions of the 
heat equation in Riemann normal coordinates. 

Lemma 1.2 In a neighborhood of in Riemann normal coordinates if 

A = ^(d i d j -T k j d k ) 

K (x;t) = (2nt)- n / 2 e- S - 1 ^ 1+ T2 Ricchlxkx,+ £ 



then 



dK 



0/ -A X = o(i+|x| + |x| 3 /i+|x| 5 /* 2 )^o. (1.8) 

Likewise if /(x) is twice differentiate with f and all its first and second deriva- 
tives bounded by C and 

Ai = A + /(x) 



K 1 ( X ;t)=K ( X ;t)e^+^ 

then 



dt 



Ai#i = Oft + |x| + |x| 3 jt + |x| 5 jt 2 + Ct + C |x| 2 + C 2 t 2 ) . (1.9) 



Finally if h l k is for < k,l < n a collection of functions o/x all bounded by 
D, then defining 

A 2 = Ax + h l k x k d l + h k k /2 



and 



h' k S w x k x l li 
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then 
8K 2 



A 2 K 2 = Oft + |x| + |x| 3 /t + |x| 5 /t 2 + Ct + C |x| 2 

+ C 2 t 2 + D \x\ + D |x| 3 jt + CD \x\ t + D 2 \x\ 2 ^K 2 . (1.10) 



Proof: Here, and in the remainder of this proof, write xj for 5jj'X° . For 
Eq. (1.8), observe that 



dK 
dt 



y~L x x> n 



It 2t 2 12 



and 



9iK = 



dx % 



Xi Riccijfca; fe 
~T + 6 



+ o(t + \ x \ 2 ) 



(1.11) 



(1.12) 



Therefore (implicitly symmetrizing both sides, since this term occurs in A 
only symmetrically) 



didjKo = 



S t j I^icci^j x^Xj 



RiccijkX k Xi 
it 



0(t+|x| + |x| 3 A) 



K 



and 



-r*^ + o(|x|) 



Using Eqs. (1.3) and (1.2) to expand the metric and Christoffel symbol in Ric- 
mann normal coordinates, and dropping low enough order terms 



A K 



g l3 6ij ff^Riccijj g lJ XiXj g t3 Ricci jk x k x l 



2t 



12 



2t 2 



6t 



+ ^f^ + 0(t+|x| + |x| 3 A) 



n t R k l u x k x l i r r x fc x fe Ricci fci x fe x z ; fl"'!^,^ 



-2^ + 



6i 



+ tt: + 



12 2i 2 



6f 



+ 



K 



+ o(t+|x| + |x| 3 /t+|x| 5 A 2 ) 



■^ + ^ + ^ + o(t + W + WV* + M 5 /* a ) 



2i 



X (1.13) 



where we have used Eqs. (1.3) and (1.5) in the last line. Subtracting Eqs. (1.11) 
and (1.13) gives the result. 
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For Eq. (1.9), notice dKi/dt is given by all the terms in Eq. (1.11) multiplied 
by K\ instead of K plus in addition 



/(0) + /(x) 



Also 



frfiUW+Kx)] _ 1 g /( x ) r ^f/(Q)+.ffx)i 
"2 &• 



so estimating 9 l ^e^ [/(0)+/(x)1 as 0(Ct + C 2 i 2 ) and estimating a 4 if ^ef[/(°)+/( x )] 
with the help of Eq. (1.12) we see /S.\K\ contains all the terms in Eq. (1.13) 
(times K\ instead of Kq) plus 



/(x) - 



x* a/(x) 

2 



C>(tC + i 2 C 2 



Noticing that 



/(0) + /(x) 



/(x) 



g/(x) 

2 ax 1 



(C|x| 2 ) 



by Taylor's Theorem gives the result. 

For Eq. (1.10), there are no additional terms to dK 2 /dt beyond those in 
dKi/dt except multiplied by K 2 instead of K\. The additional term in djK 2 is 

-/ 

(writing h jk for Sjf h 3 k ) 



(-\[h jk + h kj ]x k +D\*\ 2 ^K 2 



So the additional terms in didjK 2 are (symmetrizing on i and j) 



t 



htj +o(p |x| + CDt |x| 2 + D 2 |x| 4 ) 



K 2 



which means that the additional terms in A 2 K 2 consist of 

9 t3 h 3k x k x t 1 k x l k 
£ 2 5 ij ~ kl T k ' 

+ o(D |x| + D |x| 3 ft + CD |x| 2 t + D 2 |x| 4 ) 
from which the result follows. 



□ 
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2 Grassman Variables and the Norm 



The first subsection on Grassman variables is completely standard and can be 
found for example in [MQ86, Rog92a]. Expressing operators on forms in terms 
of kernels in Grassman variable is also in [Rog92a]. The norm | • | t is designed 
primarily to keep track of the error terms in the large partition limit in such 
a fashion that the supertrace of the error terms goes to zero as required for 
Gauss-Bonnet-Chern theorem. Apart from various baroque analytic additions, 
the idea of the norm and the rank of terms is a fairly natural scaling wherein the 
spatial variable y x scales as i 1 / 2 and the Grassman variable ip scales as i -1 / 2 . 
Lemma 2.1 collects facts about the norm used elsewhere, but the key fact is the 
first, which uses the presence of the Gaussian to translate powers of the spatial 
variable into powers of t 1 ! 2 near the diagonal, and an exponentially damped 
term away from the diagonal. This is crucial to the proof of Proposition 3.1. 

2.1 Grassman Variables 

If V is a vector space with basis v\ , . . . v n then a fc-linear function of k 
variables Wi — w\ Vj for 1 < i < k takes the form 

/(wi, . ..,w k ) = C juj2 ,... jk w{ 1 w J 2 2 ■ ■ ■ w 3 k k . 
The antisymmetrization of this multilinear function is the element of A*V given 

by 

Af(v lt ...,v k )=Y^ —^—f{w a(1)l . . . , w a{k) ) 

(7 

which is again / if it was already antisymmetric. This map from multilinear 
functions to antisymmetric ones is manifestly basis independent, but using the 
basis gives a simple representation of A*V as the algebra generated freely by 
ip , ip 2 , ■ ■ ■ , ip n subject only to the antisymmetry relation ip J ip J = —Sip^-ip 1 . The 
antisymmetry map is then given naturally by writing ip = ip 1 + ■ ■ ■ + ip n and 
interpreting Af as 

Call such an / an antisymmetric polynomial in the Grassman variable ip with 
values in V. 

To represent endomorphisms on A*V, define maps d/dtpi for 1 < j < n by 

if ip L = ■ ■ ■ %j) lk and j ^ U for all i. It is easy to confirm that multiplication op- 
erators ip and the differentiation operators d / dtp 3 generate all endomorphisms. 
In fact the maps d/dipi generate a free antisymmetric algebra as well, so the 
same convention applies to interpret multilinear functions g{d/dip) of these. 
Expressions of the form f [ip) g{d / dip) define endomorphisms (and in fact span 
all endomorphisms) . Operators that preserve form degree are spanned by those 
products in which / and g are homogeneous of the same monomial degree. Filter 
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the form-degree-preserving endomorphisms by this common monomial degree, 
and call this the degree of the endomorphism. An endomorphism of degree p is 
zero on all elements of A*V of degree less than p and nonzero on at least one 
element of degree p. 

Given a choice of degree n element of A*V, any operator t on A*V determines 
a kernel; that is, an antisymmetric polynomial K(ip x ,ip y ) in two Grassman 
variables ip x , ip y valued in V (which of course anticommutc with each other) . 
Explicitly, applying t to f(ip) is 



where the Berezin integral § f(ip y )dip y returns the coefficient of the chosen top 
element of A*V in /. Such kernels in turn admit a sort of Fourier transform, in 
terms of a Grassman variable p valued in V* and an antisymmetric polynomial 
in three variables k(ip x ,p, tp y ) : 



where the Berezin integral is with respect to the element of AV dual to the 
chosen element of A*V. If k(tp x , p, ip y ) is of the form f{ip x )g(p)h(ip y ), the oper- 
ator f is simply t = f(il>)g(id/dip)h(ip). Degree-preserving endomorphisms are 
spanned by homogeneous k in which the degree of p equals the sum of degrees 
of tp x and ip y , and the degree of the endomorphism is the degree of p. 

2.2 Bundles 

With the help of the mild subterfuge of the previous subsection, sections 
of all bundles in this paper may be expressed using the "function of variables" 
notation familiar from freshman calculus. Usually these sections will also depend 
on a positive real parameter t (or sometimes tj or Tj), which will appear in the 
list of parameters at the end, after a semicolon, to emphasize its special role. 
For the entire paper except Corollary 6.1 this t will be bounded above by an 
arbitrary constant, and all other constants, explicit and implicit, will depend 
on this bound as well as on the manifold M. 

For instance, /(x, ip x ) and f(x, ip x ;t) represent sections of the bundle A*Tm 
over M, i.e. elements of T(A*Tm) , or forms. This notation is meant to indicate 
that at each x, ip x is a Grassman variable taking values in T X M, and f(x, ij) x ) is a 
multilinear function of this variable which thus represents an element of A*T X M. 
One useful aspect of this particular notation is that, assuming the Berezin in- 
tegral uses the measure coming from the metric on T X M, J <f f(x, ip x )dtp x dx 
represents the integral of the top form component of /, regardless of the metric. 
That is, it represents the usual integral on forms. 

The arguments in the paper are concerned with sections of a particular bun- 
dle over (x,y) <E M x M which will be most conveniently discussed in two 
distinct representations. (In fact, the representations only agree near the diag- 
onal, but that suffices) The first bundle is A*T X M ® A*T y M, whose sections in 
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Y{A*T X M) ®T{K*T y M) can be represented by functions L(x,y;ip x ,ip y ) in the 
obvious fashion. Dehne E x>y to be the set of all such sections which are degree 
preserving (that is, the kernel is a sum of monomials in which the degree of tjj x 
and the degree of ip y add to n). As in the previous subsection, such sections 
represent kernels of operators on forms, via the * product 

(lf)(x,tp x ) =[L*f] (x,ip x ) = J <j> L(x,y,ip x ,ipy)f(y,ipy)dip y dy. (2.1) 

Usually the kernels will only be nonzero when x and y are within the injectivity 
radius of each other, so that the parallel transport map from T X M to T y M 
is defined. Naturally associated to this map (via transpose and antisymmetric 
product) is the map A*T y M to A*T X M, also called parallel transport Such 
kernels take the form 

j e l ( p ^y^-^)k(x, y, $ x ,1> y , p)dp. 
The simplest example would be the kernel 

P{x,y,ip x ip y ) = j e l ( p ^*-^) dp 

(P = when ^ is not defined) which is the kernel of the parallel transport 
operator itself, always assumed to be along a minimal geodesic. 

If x and y are sufficiently close, then L G E x ^ y determines a section L G E x>v 
of the bundle End (A*T x M) ® R y defined by 

L(x, y, tp x , i(; y ) = L(x, y)P(x, y, ip x ,ip y ). 

Here the subscript is a reminder that the endomorphisms are degree-preserving, 
and the subscript on R y indicates the bundle is only over the first factor of M. 

2.3 The Norm on the Bundle 

Fix for the rest of the paper an arbitrary e with 1/2 > e > 0. All explicit 
and implicit constants in what follows will depend on e. Define a norm | • | t 
depending on a positive real t on End (T X M) as follows: 

Definition 1 In terms of an orthonormal basis, 



V " ' ^ cV> Zl '"dip 



1 k ~ 2 (22) 

t (fe-2)(-l/2+e/n) fc > 2 . 1 > 



This is the inner product norm times a power of t depending only on the degree. 
If L G E XtV define |L| to be the function of x and y given by this norm on each 
fiber, and for L G E x , y define \L\ t = |L| . 

There is a real-valued kernel H of particular importance, because, where 
defined, it agrees with the flat-space heat operator; namely 

H{ X ,y:t) = (2irt)- n ' 2 e-^ (2.3) 
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if x and y are within the injectivity radius of each other and H = otherwise. 
The errors introduced by using approximate heat kernels on forms will generally 
be i-dependent sections which take the form of H times a section well-behaved 
as t goes to zero. 

Suppose L e E x<y is a i-dependent kernel of degree k, and suppose there 
are real numbers C > 0, r, and s such that \L\ < Ct r \y x \ s for all x and 
y within the injectivity radius. Then define L to be of rank p where p = 
min(2r + s, 2r + s + 2 — k). The sum of terms of rank p of different degrees is 
still called rank p. The same definition applies to elements of E x>y and the rank 
of L is the rank of L. 

For instance Ricci^, y x ) , tt, and t(p, R[ip Xl Tp x ] p) all are rank 2, while 
(p, R[y x ,tp x ]y x ) is rank 3. The significance of rank is explained in the next 
lemma: 

Lemma 2.1 

(a) If L € E x y is of rank p or less and is a smooth function of t times t c , 
where c > — 2, there is an A > such that 

LH = F 1 H + F 2 t (2.4) 

where Fi G E x , y and |F;| t < At ( P-^/ 2+e . 

(b) If L is a smooth function of t\ < t of degree at most 1 and rank 2, 
or degree 2 and rank 2 by virtue of being a multiple of t\ , then there are 
B, D > such that ifF G E x , y 

\LH(t 1 )F\ t < Bt{ /n \F\ t [H(h) + Dti] 

\~FLH{tx)\ t < Bti /n |F| t [JT(ti) + Dti]. (2.5) 

(c) J/t<* / i/ien|F| t , < |F| t . 

(d) T/iere is a C > swc/i i/iai /or a/Z F,G G i?a; >y 

|FG| t < Ci" 1 |F| t |G| t . (2.6) 

Proof: 

(a) It suffices to prove the result for L of rank exactly p, and degree k > 0. 
The proof of Eqs. (2.4) and (2.5), as well as of Eq. (4.2) below, rely 
on the following trick for converting powers of y x into powers of t in 
the presence of the Gaussian H. Choose S, C > 0, and consider sepa- 
rately the case \y x \ > Ct x / 2 ~ s and | < Ct x / 2 ~ s . In the former case 
H{x,y;t) = 0(t~ n / 2 ) e c * I 2 . This is exponentially damped, and thus 
can be bounded by some multiple of any power of t desired; in particular, 
LH = F-zt with F2 bounded as in the statement of Eq. (2.4). 



12 



In the case \y x \ < Ct 1 / 2 s , since L is rank p, it is bounded by a multiple 
of t r \y x \ s where p = min(2r + s, 2r + s + 2 — k). The bound on jy^ implies 
L is bounded by a multiple of f( 2 H-s)/2-.s5_ Thus \L\ is bounded by a 

multiple of t p / 2 ~ sS . By assumption, r > —2, so s is bounded (by p + 4 if 
k < 2 and by p + k + 2 otherwise), and therefore, given l/2>e>0,<5 
can be chosen small enough that in fact \L\ t is bounded by a multiple of 

t (p-l)/2+e_ 

(b) First notice multiplying two elements of End (A*TM) of fixed degree to- 
gether gives a product whose degree is the sum of their degrees, so that 
multiplying a degree k term (smooth in t) times F multiplies the i-norm 
by a multiple of f-fc(V 2 -e/n)_ 

Again the result is easy if |y*| > Ct\ /2 -\ for some C and 5. If instead 
\yx\ < Ct 1 ^ 2 S , then in the case that L is degree at most 1 and rank 
2, its smoothness implies r > so it is bounded by a multiple of t\~ 2S . 
Under multiplication it thus multiplies the i-norm of F by a constant 
times t \- 2 h- x l 2+t l n < t \ /2+e/n ~ 25 , which with the right choice of <5 is 

bounded by a multiple of t]f n . In the case where L is a multiple of t\ 
times a degree 2 term, then upon multiplication it multiplies the t norm 
of F by a multiple of ^(-l^+e/n) < t e/n_ 

(c) Trivial 

(d) Multiplication in End (A*TM) adds degree, and clearly is bounded with 
respect to any fixed norm. If the degrees k and j are both at least 2, then 
the i-norm of the product is a multiple of t(' £ +J _2 )( _1 / 2 + £ /") which is a 
multiple of i 2 (- 1 / 2 + £ /«) times the i-norms of the factors, so in particular 
a multiple of t^ 1 times the product of the i-norms. If cither degree is less 
than 2, the same is true with a higher power of t, and therefore in general 
it is a multiple of t~ x times the product of t- norms. 

□ 



2.4 The supertrace 

The matrix supertrace str of an operator t G Endo(A*U) is the trace of the 
restriction of 6 to even forms minus the trace of its restriction to odd forms. In 
terms of Grassman variables, the kernel of 6 is K(ip x ,i() y ) = Kijip^ipy and the 
trace of its restriction to forms of degree d is, in an orthonormal basis, 

tr t\ d = £ Kjji-lf, 

\i\=d 

where I is formed from the complement of J in {1, . . . , n}, and the last factor 
is the sign of the permutation. This follows from noticing (— l) 11 ^ 1 is the dual 
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basis vector to tp 1 . On the other hand 

* K(ip, ip)dip = Y^ K u f dip 

The integrals on the right vanish, unless J = I, so the sum reduces to J2i Kii{~^) H 
Since (— l) 77 = (— 1)I 7 H 7 I (— l) 77 , decomposing this sum according to degree and 
comparing with the sum for tr t\ d , shows 

K(ip, ip) dip = strt (2.7) 

since n is even. 

Lemma 2.2 If F is the kernel of an operator in Endo( A* V) and \F\ t G 0(t p+1+e ), 
then 

str(F) e 0(t p+n / 2+2e / n ). 

Proof: From the above characterization of the supertrace, F = str(F)-0 1 ■ • ■ ip n -^p- 
plus lower-degree terms. 

The condition on F and the definition in Eq 2.2 of the norm on Endo(A*V) 
thus require 



str(F)t ( ™- 2)( - 1/2+e/n) = str(F) 



from which the lemma immediately follows. □ 



3 The Approximate Kernel 

For background on the heat operator see [BGV04]. Proposition 3.1, which says 
K is an approximate solution to the heat equation with error terms that are 
Gaussians times errors bounded in the i-norm, is entirely computational. 

3.1 The Approximate Kernel 

The small t approximation to the propagator of N = 1 imaginary time 
supersymmetric quantum mechanics in curved space is given by the kernel 



/r tr 1 
H ° XP I ~ ~6 + Y^ Ricci ( x ?" x f) 



+ l ( P ,¥ X y lP x - ^Py) + Rtyy^y] P ) + RbP x ,i> x ] % P ) ] dp (3.1) 

when x and y are close enough to be within the injectivity radius of each other 
(so K is invertiblc in this region) and zero otherwise. All curvature tensors are 
understood to be evaluated at y except for the last R which is evaluated at x. 
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The analytic properties of K crucial in what follows will also hold for any 
kernel K' which differs from K by a term which in Riemann normal coordinates 
around y (that is in terms of x^) is H (x, y; t) times a term of rank 3 (see §2.3). 
Accordingly, write K ~ K' if the two kernels differ by such terms. 

Observe that K = (P + L)H, where P is the parallel transport operator, 
and L is a sum of terms each of which has the form a positive power of t, 
times a power of X,,, times a product of ip x , P, ipy, wherein the sum of the power 
of x y and twice the power of t is at least the degree of the ipp factors. As a 
consequence, L times any term of rank p is again rank p. Since P does not affect 
rank, any rank 3 term times K is equal to some rank 3 term times H. 

For example the Ricci and r terms are of rank 2, and replacing them with 
the same terms interpreted at x (with y x replacing x y ), changes K by terms of 
rank 3 times H (since Ricci and r are both continuously diffcrcntiable) . Thus 
this change produces a kernel equivalent to K under ~ . Likewise, changing 
variables from p to Cp^p (since CP is measure-preserving there is no Jacobian 
factor) shows that, up to H times terms of rank 3, K is symmetric with respect 
to x and y. 

In fact, up to such terms, K has a simple expression in Riemann normal 
coordinates centered at y. Notice 

(p,^ytpx ~ *l>„) = Mx-il> y ) + \ (p,R[x y ,tp x ]x y ) + o(\y x f^ y p^ 

the error term being rank 3. Likewise, in the term with the can be 

removed at the cost of a rank 4 term, and R x can be replaced with R y at the 
cost of a rank 3 term. This leads to 



K 



Hcxp 



t 



tx 1 i 

i (p, ipx-ip v )--Q+ Y^ Ricci ( x y> x v) + g (P, R[ x y, ipx] x a ) 



+ g(P,R[lPx,1px]p) + e(P,R[i>y,1Pv] P) 



dp. 



(3.2) 



That is, defining K' as 
K' = <£ H cxp 



tx 1 i 

i (p, ip x -ipy)--Q+ ^Ricci(x y , Xj,) + - {p, R[x y , ip x ] x y ) 



+ l(p,Rbl>x,1px]p) + l(p,R[lpy,1py] P) 



K - K'. 

3.2 The Heat Operator 

Let A LdR be the operator on forms given by 



dp, 



(3.3) 



1 / d 



A Ld R = -g l3 {x7i Vj -rfj-Vfc) + o ~T,R[il>,il>] TTT 



d 



dip 



(3.4) 



where A f = d t - Y^d/d^. 
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Proposition 3.1 For K = K(x,y,tp x ,tp y ;t) defined by Eq. (3.1) and A by 
Eq. (3.4) 

^ + ^A X K = F x H{x, y; t) + F 2 t, and 

^ + ^A v K = F 3 H(x,y;t)+F 4 t, (3.5) 
where F { 6 E x . y and \Fi\ t = 0(t e ) . 

Proof: 

Recalling Eq. (2.4) of Lemma 2.1, the argument reduces to showing that the 
heat operators on the left-hand sides applied to K give terms of rank 1 times 
H. To this end, first observe that the heat operator acts on H times terms of a 
given rank to produce H times terms whose rank is lower by at most 2. This is 
because the operator d/dt lowers the rank of a smooth function of t by 2, and, 
when applied to H, gives H5ijX° x° /2t , which also lowers the rank of whatever 
it multiplies by 2. In A x , the curvature term does not change rank at all (and is 
zero on H ) . The operator Vi m the other hand, lowers rank by one for smooth 
terms because di lowers the degree of Xj, by 1 and the T^ipi d / dip k actually 
increases rank by 1 (by Eq. (1-3)) Acting on iJ, v« gives —HSijX^/t, which 
lowers the rank of whatever it multiplies by 1. Thus, taken together, the heat 
operator will lower rank by at most two. As a first consequence, if the heat 
operator applied to some K ~ K gives terms of rank 1 times H, then the same 
is true for K itself. This in turn implies the two equations in the proposition are 
equivalent, since, as noted above, K is symmetric up to terms of rank 3 times 
H. 

It suffices, then, to confirm the result on K' ~ K, where K' is defined 
in Riemann normal coordinates by the right-hand side of Eq. (3.3). In fact, 
simplifying the heat operator itself in Riemann normal coordinates, by dropping 
terms that when applied to K 1 give H times terms of rank 1 or higher, will lead 
to an operator A2 and a kernel K2 ~ K' for which, by Eq. (1.10) of Lemma 1.2, 
the proposition holds. 

Begin by expanding ALdR. in Riemann normal coordinates, 

where Ao is defined in Lemma 1.2. The terms omitted, when applied to K', 
give terms of rank 1 or higher times H (each di lowers rank by 1, each factor of 
T increases rank by 1). 
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Thus 

A LdR = -2 [A + h[x k d l + h k k /2 + /(x)] 

where h{ = |j« (r^ + R q p ki ) and / = -\(jfa,R[^\ ^) . The ex- 

prcssion in Riemann normal coordinates denotes a differential operator on R n 
with coefficients in Endo(A*IR n ) , so it still makes sense to refer to the degree 
and rank of a term. Of course, this is a noncommutative algebra, but since 
r 11 " 1 ™ 1 I 2 and e - *^ commute up to terms of rank 4, Eq. (1.10) applies, with 
C of degree 2 and D of degree 1, to conclude 



dK 2 1 



+ -A LdR K 2 = FK 2 



dt 2 
where 

K 2 (x;t) = He^ mcchixixi+ ^~^ RkliixixHk ^~^^ M ^\ 

R = 2 (R X + Ry) ■, and F, representing all the contributions to the error term in 
Eq. (1.10), is of rank 1. 

It remains to show K 2 ~ K', when the first is expressed as a kernel integrated 
over p and the second is given by Eq. (3.3). 

Towards that goal note 

d d d d 

= -R l]kl {xW^ —^—^ + R ijki ( x )^— - R ijkj ( x )ipi — 

d d d d 

= -r^x)^^-^ - RlMy) A-l-^ + 2Rijki{xW JL + 2Rjiu{y) ±.^ 

= -RijuWp — — RlMy) — —^ + 2t(y) + Rank 1. 

Multiplying by —t/8, exponentiating and writing the resulting operator as a 
kernel integrated over p gives 

The rank 1 error term disappears, since the exponential of t times a rank 1 term 
is 1 plus a rank 3 term. Combining this with the other Grassman term from 
K 2 , gives, as an integral over p, 

= (/i/ e i <' 9 .^-^«)-T-S^™3! Km ^V ! ^ + |[(P,^x>x]p)4<P,^B,V' ! /]p)]^ j0 _ 
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Comparing the definition of K 2 above with that of K' in Eq. (3.3) gives K 2 ~ K' . 
□ 



4 The * Product of Kernels and the Norm 

The material in this section relies heavily on §2.2 and §2.3, requires the defi- 
nition Eq. (3.1) of K, and makes one brief reference to Eq. (1.7). Lemmas 4.1 
and 4.2 involve rather technical bounds, and are only necessary for the proofs 
of Propositions 4.1 and 4.2. Only Eqs. (4.6), (4.7), and (4.9) are needed going 
forward for the rest of the paper. Notice the norm || • || t could not be defined un- 
til Lemmas 4.1 and 4.2 guarantee the existence the quantity D in the definition 
so that Eq. (4.6) holds. Once the norm is defined, Corollary 4.1 can be viewed 
as the correct statement of Proposition 3.1. 

4.1 Bounds on K and H 

Recall that L\, L 2 £ E x ^ y represent operators on F(A*TM) and the product 
of operators corresponds to the * product 

L 1 * L 2 (x,z,ip x ,ip z ) = J <j> L 1 (x,y,ip x ,ipy)L 2 (y,z,ip y ,'il) z )dil) y dy. 

Lemma 4.1 For bounded t > 0, and H as in Eq. (2.3), 

f H(x,y;t)dy = l + 0(t); (4.1) 

for bounded ti, t 2 > with t = ti + t 2 , and e as in $2.3, 

H(h) * H(t 2 ) = e°( t;/ ") [H(t) + 0(t 2 )] , 

H(h) * H{t 2 ) - e°(* |/n ) [H(t) + 0(t 2 )] . (4.2) 



Proof: By Eq. (1.1) the measure dx = 1 + o(\x v \ 2 ^J 
from the inner product on T y M, so Eq. (4.1) becomes 



dx y where e?x y comes 



H(x,y;t)dx = / (2irt)- n/2 e 



l + o(| X2/ | 2 )] dx y = l + 0(t). 



For Eq. (4.2), following the t 1 / 2 *-trick in the proof of Lemma 2.1, consider 
first the case where |z x | is greater than some multiple of t 1/>2 ~ s . Then either 

j/2 (5 1/2 5 

\x y \ is more than a multiple of or |y z | is more than a multiple of t 2 

so that the entire integral 



H(h) * H(t 2 )(x, z) = (2nt 1 )- n / 2 (2nt 2 ) 



i/2 



2*2 dy 
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is bounded by a multiple of t 2 times the integral of a multiple of H(x,y;t), 
which by Eq. (4.1) is bounded by a multiple of t 2 . 

If \z x \ is less than a multiple of t 1 / 2 ^ 5 , choose the multiple small enough first 
of all that x and z are within the injectivity radius of each other. Define 

E(x, y, z; a) = (1 - a) \x y \ 2 + a \z y \ 2 - a(l - a) \z x \ 2 

on the compact domain where a E [0, 1] and the three lengths are all well- 
defined, so that if a = t\/t then 



ti 



+ 



*2 



t 



t 

ht 2 



E. 



Let u be the point on the geodesic between x and z that is a of the way from 
x to z, so that u x = az x and u z = (1 — a)x z . Apply Eq. (1.7), to rewrite 
\ x v\ 2 j \yx\ 2 and \z y \ 2 in terms of |y„|, |ua,| and |u z |, and then express these in 
terms of \z x \ to obtain E(x, y, z; a) — |y„| 2 1 + o(a(l — a) \z x \ 2 ^j . 

Choosing S and the coefficient of t 1 / 2 ^ 8 wisely, E is bounded below by 
|y«| 2 [l ~ Ca(l — ct)t e ] where C is small enough so this is always positive. Also 
bound dy by ^1 + D |y u | 2 ^ dy u for some D > 0, so 



H(h) * H(t 2 )(x, z)= J (2 7 ri 1 )-"/ 2 (2 7 ri 2 )-" 
< (2irt)- n ' 2 e-^ f (27rtit 2 /t) 



/ 2 , 



.|-y| 2 _l^| 2 



2t l 



2t 2 dy 



i/2. 



M^L[l-Ca(l-a)t«] 

= H(a;, z;t)[l - Ca(l - a)t e ]^ n/2 (l + Da(l - a)f) n/2 . 



l + D\y u \ 



dy u 



From here each estimate in Eq. 4.2 is immediate. In fact, this shows the bound 
could be strengthened, replacing the exponential factor by 1 + 0(£ £ ), but the 
weaker form is sufficient and more convenient later. □ 



Lemma 4.2 Fort > t\ > bounded, K(t\) — K(x,y,ip X} ip y ;ti) in Eq. (3.1), 
e and \-\ t as in §2.3, and F E E x , y , 



\K{x,y-M)F 



pi - A t i /n ) ip 



F\.[H(x,y,ti) + 0(ti 



\FK(x,y;ti) 
and iff(x,tp x ) E r(A'TM). 



FK(x.v:h)\. = e ^) \F 



F\.[H{x,yM) + 0{tx)] 



hm K(t)*f = f. 
t->o 



(4.3) 
(4.4) 



Proof: For Eq. (4.3), notice that K can be written as (P + L) H, where L is a 
sum of products of terms which are either rank 2 and degree at most 1 (namely, 
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ei 1 2 Ricci )c! xV+ l 2 _ ^ or _ i R km jix m x : >ip k -^ T ) or a multiple of h and degree at 

most 2 (namely, -f (j^, R[ip, -0] ^)). The result follows from Eq. (2.5). 

For Eq. (4.4) let R be the the set of x for which K (x, y; t) is nonzero for 
a given y. Notice K(x,y^ x ,^ y ;t) = He i ^*-^)[l + 0{\y x \+t)], and, if / is 
smooth, f(y,tp y ) = f(x, ip y ) + 0(|y x |), so 



[F(f) */] (a;,Vx) = / <f> K(x,y,ip x ,ip y ;t)f(y,ipy)dip y dy 

■ (2^)-»/2 e -^#+i(.,^-^> ^) + 0( | y!t | + t)] dp#B ^ 

= / {2Ttt)- n l 2 e-^[f(x^ x ) + 0(\y x \+t)]dy x 

JR 

= f(x,^ x ) + o(t 1 / 2 ). 



□ 



4.2 The Norm on Kernels 

Error terms arising in the following sections will typically be kernels of the 
form 

[Fi (x, y; t) H(x, y; t) + F 2 {x, y; t)] P{x, y) 

where F\,F 2 G E x _ y . The size of |Fj| t gives a rough measure of the size of these 
errors, but more precision requires additional details on the size of F 2 . 

Proposition 4.1 Given e as in §2.3, there exists a D > such that, defining 
\\L\\ t for L G E Xi y to be the least number such that 

\L\ t <\\L\\ t [H(t)+Dt], (4.5) 

this norm satisfies, for K defined in Eq. (3.1) and for bounded and positive t\ 
and t 2 with t\ + t 2 = t, 



t 2 



\K{h)*L\ t <e^ J\\L\\ 

||L*F(i 2 )|| t <e ^ /n )||L|| ti (4.6) 

and 

||Li*L2|| t <0(i- 1 )||ii|| tl ||i2|| t2 (4.7) 

Proof: Observe that 

LTVT 2 {x,z) = J L x (x,yW v T7 2 {y,zy$ldy, 

and that | • | t is unchanged by parallel transport. For the first line of Eq. (4.6), 
apply this in the special case L\ = K{t\). Then, using Eq. (4.3) to define 
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bi,D > 0, applying property (c) of Lemma 2.1 with t 2 < t, using Eq. (4.2) to 
define b 2 , D 2 > 0, and using Eq. (4.1) to define D 3 , 



K(ti)*L\ t < J \K{x,yMW y L{y,z)^l\ t dy 

< e fclt i / " J \L\ t [H(x, y; h) + Dh] dy 



\\L\\ t2 J [H(y, z; t 2 ) + Dt 2 ] [H(x, y;h) + Dh] dy 



< e bltl/ " \\L\ t2 U b2tl/ "H(x 7 z; t) + Dt 2 + Dh + D 2 t 

+DD 3 ht 2 + DD 3 ht 2 + D 2 ht 2 Vol(M)] 

< e ^ +b ^ /n ||L|L H(x, z; t) + e"^" \\LL Dt 



2 



,bit{ /n 



\\L\\t 2 Dt 



D 3 (h+t 2 ) + £lt + DVo\(M)t 



<e bt i /n \\L\\ t2 [H(x,z;t) + Dt]. 

Here the D defining the norm in Eq. (4.5) is the constant coming from Eq (4.3), 
and b must satisfy b > bi + b 2 and bt\ /n > bit^ n + D 3 t + D Vol(M)i + D 2 t/D. 
Note such a b is guaranteed to exist since t]^™ times the right-hand side of this 
inequality is bounded, independent of t. The last line then follows from obvious 
estimates on e ( - b ~ bl ^ tl , using t = h + t 2 once more. The second version of 
Eq. (4.6) works in exactly the same fashion. 

For Eq. (4.7), for some C\ > coming from Eq. (2.6), C 2 > coming from 
Eq. (4.2) and C 3 > coming from Eq. (4.1) and (4.2) (and depending on D) 



\L x *L 2 \ t < J |Li(a:,»)q^L 2 (»^)q3»| t dy 

< Cit' 1 J \Li(x,y)\ tl \L^(y,z)\ t2 dy 



< Ci ||Li|| ti ||L 2 || t2 1- 1 J [H(x, y; h) + Dh][H(y, z; t 2 ) + Dt 2 ] dy 

< d ||Li|| ti ||L 2 || t2 r^CiHix, z-t) + C 3 Dt] 
<C 4 |Li| tl \\L 2 \\ t2 t- 1 [H(x,z;t) + Dt] 

for C 4 > Cimax(C2,C 3 ). □ 
With this definition, Eq. 3.5 of Proposition 3.1 immediately implies 
Corollary 4.1 If e is as in $2.3, K as in Eq. (3.1) and A as in Eq. (3.4) 

^-K = --A x K + O t (t e ) 
dt 2 v ' 

= - l -A y K + O t {f) (4.8) 



21 



where O t [f(t)] refers to an arbitrary kernel bounded in \\ ■ \\ t by a multiple of 
fit)- 

4.3 Approximate Semigroup Property 

Proposition 4.2 For K(t) = K(x 7 y,ip x ,ip y ;t) in Eq. (3.1), the norm \\ ■ \\ t as 
in Proposition 4-1 and t = t\ + t 2 > 

\\K( tl )*K(t 2 )-K(t)\\ t = 0{t 1+ *) (4.9) 



Proof: 

Let r be the injectivity radius of M, so that K(x, y; t) = if x and y are more 
than r apart. Suppose first that the spatial variables x and z implicit on the 
left-hand side of Eq. (4.9) are more than r/2 apart. By the triangle inequality, 

K(ti) * K(t 2 ) - K(t) < kTtT) *K(t 2 ) +\K(t)\.. The second summand is 

bounded by a constant times H(x,z;t), which, because the distance between 
x and z is bounded below, is exponentially damped for small t; in particular, 
it is 0(t 1+e ). The first summand is bounded by some non-negative power of 
t times H(ti) * Hfa)- Since within the integral represented by the * product 
every point y in the domain is a distance at least r/4 from either 1 or z, this 
integral is likewise exponentially damped, which proves Eq. (4.9) in this case. 
If x and z are less than r/2 apart, 



K(h) * K(t 2 ) - K(t) = / d{K(r) * K(t - t)} /dr dr 
Jo 

= ( 1 K{t) * K(t - t) - K{t) * K(t - t) dr 
Jo 

= / O r (T e ) * K(t -t) + K{t) * Ot-At - rf) dr 
Jo 

+ f 1 -A v [K(t)] * K(t - r)dr + K{t) * A y [K(t - r)] dr 
Jo 

= f 1 O t (T e ) + O t [(t-T) e ] dr 
Jo 

+ r f *d y [K{T)]K(t-T)dy-K{T)*d v [K(t-T)]dy 

JO Jdy 



dr 



where the third equation comes from Eq. (4.8), the fourth from Eq. (4.6), and 
boundary term from taking the formal adjoint of Aljir- The J„ indicates the 
integral in the y variable of the given forms along the boundary of the earlier 
region of integration, and * denotes the Hodge star operator. 

The first integral reduces to Ot{t 1+t ). For the boundary integral, the do- 
main of integration has bounded metric volume (bounded by the sum of the 
volumes of the spheres of the injectivity radius around x and z), so to bound 
the integral it suffices to bound the integrand. This integrand is of the form 
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a polynomial in t, t — r, r _1 , and (i — r) _1 times a smooth function of x,y,z 
times e~' yx ' / 2T ~ly*l / 2 (*~ T ). Each point on the boundary is a distance r from 
either x or z, and since x and z are less than r/2 apart, each point is a distance 
at least r/2 from both. Thus the Gaussian is exponentially damped in r and 
t — t, so the whole expression is bounded by a multiple of any power of t\ and 
t 2 . In particular it is 0(t 1+e ) — O t {t e ) . Eq. (4.9) then follows upon completing 
the now trivial r integral. □ 



5 Partitions 

5.1 Partitions and the Pointwise Limit 

For t > 0, a partition P of i is a sequence P = (ii, t 2 , ■ ■ ■ , t m ) with t — J2i U- 
If P is a partition of t and J" is a partition of t' , then the concatenation PP' 
is a partition of t + t'; if Pi is a partition of U for 1 < i < m, then P1P2 • • • P m 
is a refinement of P = (ti, . . . , t m ). Dehne |P| = maxt^. If P is a partition of t, 
and if is any kernel, define 

K* p = K(t 1 )*K(t 2 )*---*K(t m ) (5.1) 

Take {P : |P| < r} for r > to define open sets in the space of all partitions 
of a given t. Then it makes sense to define lim|p|^ of a function depending on 
t and P. 

Proposition 5.1 For K as defined in Eq. 3.1, fo{x,tp x ) piecewise continuous, 
and t > 0, 

f(t)=tim o K* p *f (5.2) 

is the unique solution f(x,tp x ;t) to the heat equation df/dt — —^^LdRf with 
/(0) = /„• 



Proof: Let f(t) be the solution to the heat equation with /(0) = fo (the 
dependence on x and i(j x is understood). The definition of K ensures that 
K(t) * f(s) is a smooth function of t with \im t ^o K(t) * f(s) = f(s). Then 
f{ti + t 2 ) — f(t 2 ) = t\ ALdR./^) + C(^i) (ah explicit and implicit constants in 
this proof depend on / and thus fo). Using Eq. 4.4 to expand about t\ = 0, 
and (4.8) 

* f(t 2 ) + O(tj) = tiA LdR /(t 2 ) + 0(t\ +f ) . 

t=o 

This uses the fact that for E(t) = O t {t e ), E(t) * / = 0{t e ). 

Thus there is a constant C such that \K{t\) * f(t 2 ) - f(h + 1 2 )\\ 00 < Ct\ +e 
for all t\ +t 2 < t. Since the operator K* depends smoothly on t and approaches 



K(h) * f(t 2 ) - f(t 2 ) = U 



dK 
~dt 
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the identity as t goes to we can also select C so that the operator norm of 
K(ti) is less than e ctl . 

Therefore decomposing P for each tk 6 P as P\ (tk)P2, and writing [K* p * /(0) 
as an appropriate telescoping sum, 



[K]* P *f{Q)-f{t) 



[x]* Pi * K(t k ) */(]>>) -/ ( + E ** ) 

\i>fe / \ i>fe / 



< ^ e c £*«= *^c4 +£ < Ce ct t \P\ e 



which converges to under refinement. 



□ 



5.2 The Cauchy Property 

Proposition 5.2 For sufficiently small t > 0, all partitions P of t, all refine- 
ments Q of P, and with the norm defined in Prop. 4-1 



\K* Q -K* p \\=0{t)\P\ 



(5.3) 



Proof: Eqs. (4.6), (4.7), and (4.9) give constants a,b,c such that for all t = 
h + t 2 positive and bounded and all L, L\,L 2 € E x . y 

\\Kih) * K{t 2 ) - K{t)\\ t <at 1+ ^ 



\K{H)*L\ t <e*i \\L\\ t2 
\\L*K{t 2 )\ t <e^ /n \L\\ ti 
\Li*L 2 \ t < ct' 1 jL^ \\L 2 \ 



1 2 



(5.4) 



The proof breaks down into three lemmas which respectively provide positive 
constants a\,b\, and b 2 such that for sufficiently small t (depending on cn,bi), 
and for every partition P of t 



\K* P -K(t)\\. <aii 1+e , 



(5.5) 



for every t\ + 1 2 = t, every pair of partitions Pi of t\ and P 2 of t 2 , and every 
L G Ex.v 



\K* Pl *L\l < e blt i n \\L\\ 



and for all partitions P of t and all refinements Q of P, 



(5.6) 



\K* Q -K* p \\ < 



ai e b ^ ln t\P\ e . 



(5.7) 
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Since t is bounded this implies Eq. (5.3). 



□ 



The proofs of Eqs. 5.5 and 5.7, provided in the lemmas below, rely on the 
following induction process: For any partition P of t, there exists a tk in P 
such that, writing P = P\{tk)P2, with Pi a partition of n and t = n + tk + t 2 , 
Ti < t/2. (the largest k so that n < t/2.) Assuming a given property holds for 
Pi and P 2 and proving it therefore holds for P will prove it holds in general, 
provided it also holds for the empty partition of and the trivial partition (i). 

Lemma 5.1 Eq. (5.5) holds. 

Proof: Begin with the special case P = (ti,t 2 ,t3). Several applications of 
Eq. (5.4) imply there is an a 2 depending on a and b such that 

\\K* P -K(t)\\ t < a 2 t 1+e (5.8) 

in this case. 

To begin the inductive argument, note Eq. (5.5) clearly holds when t = or 
P = (t). For the inductive step, write P = Pi(tk)P2, where Pi is a partition of 
Ti < t/2. Using the decomposition 

I ABC - A'BC'\ <\{A- A')BC'\ + \A'B(C - C')\ + \(A- A')B{C - C')\ , 



\K* P -K{t)\\ t 

= \\K(n) * K{t k ) * K{t 2 ) - K{t) + K* Pl * K{t k ) * K* P2 - K(n) * K{t k ) * K{ 

< \\K(ti) * K(t k ) * K(t 2 ) - K(t)\\ t + a 1 r 1 1 + £ e bt - / "+^ / " 
+ aie b ^ /n+bt l /n T 2 1+ * + C t- 1 a?e b ^ / "r 1 1+ V 2 1 + e 

< a 2 t 1+£ + aie 2bt " n 2- f -f Tl + aie 2U " n 2~ f -fr 2 + ca\ e ^ /n 2 - 2 ~ 2 H 1+2e 

< a 2 t 1+ < + ai2^e 2ht ' /n t 1+e [l + ca^" 2 " 2 ^ 6 ] . 

The second inequality uses tk < t and Tj < t/2, the third is just factoring 
plus some easy estimates. For this to be less than ait 1+e requires that f(ai,t) 
be greater than a 2 , where f(ai,t) = 01 — ai2 _c e 2b * e/ " [l + cai2 _2_2e t e ] . When 
t = this is true for sufficiently large ai , so by continuity there is a value of ai 
for which it is true for all t less than some bound. □ 



Lemma 5.2 Eq. (5.6) holds. 
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Proof: Using Eqs. (5.5) and (5.4) 

\\K"^ *L\\ t <\K{t 1 )*L\ t + ct- 1 a 1 t\ +e |L| ta 



< 



e" 1 + cait\ 



e/n 



\\L\\ t2 < e Ml ||L| t2 



In the second inequality use Eq. (5.4) and bound t\ by t, and in the third choose 



b\ large enough that e bltl/ bounds e btl ' + ca\t\ 



□ 



Lemma 5.3 Eq. (5.7) holds. 



Proof: Write Q = Q1Q2 • • • Q m where Qi ls a partition of U in P. Again proceed 
by induction on P, the empty case being trivial and the P — (t) case following 
from Eq. (5.5). For the induction step, write P = Pi(tfc)p2 with Pj a partition 
of Tj < t/2, and this time apply the decomposition 

\ABC - A'B'C'\ = \{A- A')BC'\ + \A'B(C - C)\ + \A'(B - B')C'\ 
+ \{A-A')B{C -C')\ 



to get 
\\K*Q — K* p \ 



\ K *Qi-Qk-i * K *Q k ^ K *Qk+i -Qn _ K Pi * K ( tk j * K *Pi I 



< e b ^ /n + b ^ /n + b ^ /n ai n \P 1 \ e + e b ^ / "+ b ^ /n + b ^ /n a 1 T 2 |P 2 | 



+ e 6ir 1 ' /B +6ir a "» ait l+ e + ca 2 t -l ebl tr +b2 T^ +b2 T^ TiT2 |p. 



e/r, 
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The first inequality follows from the inductive hypothesis Eq. (5.7), Eqs. (5.4), 
and Eq. (5.5). The second follows from n < t/2 and tk < t as before, along 
with tk < \P\, \Pi\ < |P|, and tit 2 < t 2 /4. We can clearly choose 62 big enough 
to get the third inequality. This quantity divided by t\P\ e is clearly less than 
aie 62 '" " when t = 0, so it must be for sufficiently small t (|P| is bounded by t 
of course). □ 



6 The Large Partition Limit 

Theorem 6.1 For sufficiently small, positive t the pointwise limit 

K°°(t) = lim K* p (6.1) 

|P|->0 
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exists in E XyV and is smooth in t. In fact, for all partitions and any < e < 1/2 
\\K°°{t)- K* p \\ t =0(t\P\ e ), (6.2) 

and, in particular, 

\\K°°(t)-K(t)\\ t = 0(t 1+e ). (6.3) 

Moreover, the limit K°°(t) is the heat kernel for the Laplace-de Rham operator, 
satisfying 

§- t K°° = - l -& L <mK°° (6.4) 
and for every form f e T(A*TM) 

limK°°(t)*f = f. (6.5) 



Proof: For a fixed t, The expression H(t) + Dt used to define the i-norm in 
Prop. 4.1 is bounded above and below, so the i-norm || • || t is bounded above and 
below by a multiple of the supremum norm. Therefore, fixing some sequence P n 
of refinements with lim Tl _ ! . 00 \P n \ =0 the sequence K* p ™ is Cauchy by Eq. (5.3) 
and thus converges to some K°°(t) continuous in t and the suppressed x and y. 
In particular 

\\K°°(t)-K* p "\\ t = 0(t)\P n \ e . 

If P is any other partition of t then for each P n with \P n \ < \P\ there is a P 1 
which is a common refinement of P and P„, and therefore by Eq. (5.3) again 
all the quantities K* p ' , K* p , K* p ™, and K°°(t) differ by 0(t) \P\ e , proving 
Eqs. (6.1), (6.2), and (6.3). 

If f(x,ip x ) is any smooth form on M, there exists for small enough t > a 
smooth form f{x,ij) x ;t) satisfying the heat equation 

and 

f(x,i> x ;0) = f{x,i> x ). 

By Eqs. (5.2) and (6.2) 

K°°(t)*f = f(t) 

proving Eq. (6.5). Thus as a distribution K°° is a solution to Eq. (6.4), and 
therefore, since the Laplace-de Rham heat equation is elliptic, by elliptic regu- 
larity ([Eva98]) it is in fact smooth in x, y, and t and satisfies Eq. (6.4). □ 



Corollary 6.1 In fact K°° is defined for all positive t by 

K°°(t)= lim [K( ti )]* P , (6.6) 

|P|->0 

and is the heat kernel satisfying Eqs. (6.4) and (6.5). The above limit converges 
pointwise and, for fixed t, uniformly. 



27 



Proof: Suppose Theorem 6.1 applies for all positive t less than some T > 0. 
For such t, K°° satisifies the semigroup property K°°{t\) * K 00 (t 2 ) = K°°(t) for 
t = t\ +<2, since the two kernels give the same operator on forms. Extending the 
definition of K°°{t) for all positive t to agree with (if°°)* p for P any partition 
of t fine enough that \P\ < T is therefore well-defined. This extended K°°{t) 
still satisfies Eqs. (6.4) and (6.5). 

To see that this extended K°°(t) satisfies Eq. (6.6), consider any partition 
Q with \Q\ < T/2, so that it can be written as a refinement of a partition P 
where each each tj satisfies T/2 <U <T. That is, Q = Q\ ■ ■ ■ Q m , where Qi is 
a partition of ij. Eq. (6.2) and \Qi\ < \Q\, guarantee 

\\K*Q< -K°°(U)\\ ti =0(U)\Q\ e . 

Since U is bounded above and below || • || t . is commensurate with the sup norm, 
so there is a b > such that 

Also, because K°° is the kernel of an elliptic operator there is a c > such 
that \\K°°(t) *f\\ oc <e ct 1/1^ . Induction on m will prove \\K*® - K 00 ^)^ < 
e at \Q\ e . For the induction step, apply the decomposition 

\AB - A'B'\ <\(A- A')B'\ + \A'(B - B')\ + \{A- A')(B - B')\ 

to get 

\\ K *Q -K^it)^ = jK* Q1 '" Qm ~ 1 * K* Qm — K°°(t — t m ) * K^ftm)^ 

< + e c (*-*".)6t m + Vol(M)e a( *- t - ) 6t m |Q| e j \Q\ e 

< e -(*-*»)[ e «=*m+6 ltm ] |Q| e 
<e at \Q\ e 

assuming in the second inequality that a > c and choosing b\ > b + Vol(M)bT e 
and in the third assuming that a > c+ b\. Eq. (6.6) follows. □ 



Remark 6.1 In [FS08] the authors prove a weaker result for a kernel K mq 
similar to K . There the result was that the limit taken along some sequence of 
refinements of P exists and agrees pointwise with the heat kernel. The stronger 
result depends on modifying the approximating kernel to agree with the heat 
kernel (as a distribution) to higher order in t. Both approximate kernels are 
consistent with time-slicing the imaginary-time path integral, as they amount to 
different interpolations between the fixed end-values. 
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Remark 6.2 The choice of K as the approximate kernel, and the basis for 
a discrete approximation to the path integral, is, as noted in the introduction, 
subject to some ambiguity in how the path integral specifies a discrete approxi- 
mation. It is natural to ask how the results of Theorem 6.1 and Corollary 6.1 
depend on the exact choice of K. Most straightforwardly, adding terms of the 
form 0(i 2 ) H or o(\y x \ 3 ^J H, and using the standard norm in place of \ ■ \ t , it 

is easy to follow along with the argument and see it all goes through unchanged. 
This would not be sufficient to prove the Gauss- Bonnet- Chern Theorem, but in 
most other respects seems as powerful a result. More subtly, adding to K a term 
of the form H\y Xl Ay x ) — ttrA] , where A is section of End(T x M) , does not 
change the limit, although the above argument in itself would not suffice and the 
convergence is slower. In particular Eq. (6.3) would no longer hold (even with 
a modified norm). Thus while K = He tx / & , which would appear to be the sim- 
plest choice, will converge in the large partition limit to the heat kernel for the 
Laplace- Beltrami operator, only K = iJ e * r / 12 +* Rlcci (y*>y*)/ 12 ff ers an analogue 
of Eq. (3.5) and hence Eqs. (6.2) and (6.3). 

7 The Gauss-Bonnet-Chern Theorem 

Theorem 7.1 

= (2^r n/2 Pfaff(i?) (7.1) 

top 

Remark 7.1 McKean and Singer [MS67] prove the integral of the matrix su- 
pertrace of the heat kernel, i.e. J § K°°(x,x,ij) x ,il) x ;t)dil) x dx, is equal to the 
signed sum of the Betti numbers, independent of t. Thus, the theorem implies 
the Gauss-Bonnet-Chern theorem, thereby completing a rigorous version of the 
path-integral proof of the latter. In fact, it implies a local version of the Gauss- 
Bonnet-Chern theorem: 

Jim strX°°(x, x; t) = (2ny n/2 Pfaff (R)(x). 

Proof: Theorem 6.1, in particular Eq. (6.3), estimates the heat kernel K°° 
as the approximation K plus an error term whose i-norm is 0{t l+<L ). That is, 
K°°{t) - K{t) = F\H + F 2 t, where \F t \ t = 0(t 1+e ) , so the supcrtrace of the 
error term is bounded by 

strf\(27rf)~™ /2 + strict. 

Lemma 2.2 implies str Fi = 0(i"/ 2+2e /™) so the supertrace of the error term is 
bounded by 0(i 2e / n ) and therefore goes to zero with t. On the other hand by 



limK°°(x,x, i> x ,i> x ;t) 
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Eq. (3.1) 

lim d> K(x,x,ip,ip;t) = lim(27rf) 




= (27T)-" /2 ^ e ^* 

= (27r)-" /2 PfafF(i?). 



8 Some conclusions 

The heuristic path integral argument leading to the Gauss-Bonnet-Chern and 
other index theorems has two key premises: first, the path integral for super- 
symmetric quantum mechanics, taken over free loops, gives the supertrace of 
the Laplace-de Rahm heat kernel; second, steepest descent applies to the path 
integral to give the Pfafhan of the curvature as its small-i limit. Proposition 3.1 
and Eq. 6.3 of Theorem 6.1 give the precise sense in which the kernel K ap- 
proximates the heat kernel for small t. The product K* p thus represents the 
approximate imaginary-time path integral for paths with fixed endpoints, in 
accord with Feynman's time-slicing approach to combining a choice of partition 
P with the small-f approximation. Corollary 6.1 to Theorem 6.1 then proves, 
that, with this precise definition, the path integral with fixed endpoints agrees 
with the heat kernel. This immediately leads to the first premise of the heuristic 
argument. For the second, Eq. 6.3 proves the steepest-descent approximation 
applies to this definition of the path integral, and Theorem 7.1 shows this ap- 
proximation agrees with the Pfaffian in the small- i limit. 

In short, lim| P |^ ^* P provides a rigorous definition for the path integral 
in which the path integral argument for GBC carries over directly. As such, it 
may serve as a template for rigorous definition path integrals for such immedi- 
ate generalizations as N = 1/2 supersymmetric quantum mechanics, which the 
authors believe requires only minor changes to the definition of K and should 
give a path integral proof of the Atiyah-Singer index theorem. More generally, 
it is not unreasonable to hope this approach will generalize to a wide range 
of cohomological field theories, because each of these depend, at least heuris- 
tically, on the same localization to the steepest-descent (or stationary-phase) 
approximation. 

Looked at another way, the heuristic arguments rely on an unstated assump- 
tion about the time-slicing prescription for the path integral; namely, the order 
of taking the small-i and small- |P| limits docs not matter. The application of 
Lemma 2.2 in the proof of Theorem 7.1 shows, in conjunction with the Cauchy 
property of Proposition 5.2, that, even for the path integral corresponding to 
based loops, the small-i limit of K°° agrees with the small-|P| limit of K * p (0). 

In their seminal work on the heat kernel McKean and Singer [MS67] referred 
to "fantastic cancellations" which must occur amongst lower-order terms in the 
asymptotic expansion for the heat kernel so that its supertrace agrees with 
that of the Pfaffian of the curvature, in keeping with the Gauss-Bonnet-Chern 



30 



theorem. In the present construction of the path integral, according to Eq. 6.3, 
the asymptotic expansion for the heat kernel begins with terms which on the 
diagonal are 0{t~ n / 2 ), yet the supertrace is 0(1). This happens because, first, 
the approximating kernel K already has this property, and second, the i-norm 
was devised so that the error terms, which are t (i 1+£ ), have supertrace in 
0(t e / n ). 

□ 
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